Exploiting Omissive Faults in Synchronous Approximate Agreement

نویسندگان

  • Mohammad H. Azadmanesh
  • Roger M. Kieckhafer
چکیده

ÐIn a fault-tolerant distributed system, it is often necessary for nonfaulty processes to agree on the value of a shared data item. The criterion of Approximate Agreement does not require processes to achieve exact agreement on a value; rather, they need only agree to within a predefined numerical tolerance. Approximate Agreement can be achieved through convergent voting algorithms. Previous research has studied convergent voting algorithms under mixed-mode or hybrid fault models, such as the Thambidurai and Park Hybrid fault model, comprised of three fault modes: asymmetric, symmetric, and benign. This paper makes three major contributions to the state of the art in fault-tolerant convergent voting. 1) We partition both the asymmetric and symmetric fault modes into disjoint omissive and transmissive submodes. The resulting five-mode hybrid fault model is a superset of previous hybrid fault models. 2) We present a new family of voting algorithms, called Omission Mean Subsequence Reduced (OMSR), which implicitly recognize and exploit omissive behavior in malicious faults while still maintaining full Byzantine fault tolerance. 3) We show that OMSR voting algorithms are more fault-tolerant than previous voting algorithms if any of the currently active faults is omissive. Index TermsÐApproximate agreement, clock synchronization, convergent voting algorithms, fault-tolerant distributed systems, hybrid

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Omissive Faults to Obtain Local Convergence in Partially Connected Networks

Approximate Agreement is an important issue in faulttolerant distributed computing where non-faulty processes exchange and vote upon their local values, to arrive at values which are within the range of the initial values of the nonfaulty processes and within a predefined tolerance of each other. Results to date in Approximate Agreement, however, are not capable of exploiting omission faults. O...

متن کامل

Exploiting Markov Chains to Reach Approximate Agreement in Partially Connected Networks

The research in reaching Approximate Agreement (AA) for fully connected networks is relatively mature. In contrast, the literature survey of the AA problem for partially connected networks is evident of considerably less work. This is due to the fact that a node may not have a complete view of the global network, which makes it difficult to attain the convergence properties. The complexity of t...

متن کامل

A Perception-based Fault Model for Single-Round Agreement Algorithms

We propose a new hybrid fault model for clock synchronization and single-round (approximate) agreement in synchronous distributed systems1, which accurately captures both node and link faults. Unlike conventional “global” fault models, which rest upon the total number of faulty nodes in the system, it solely relies upon the number of faults in any two non-faulty nodes’ “perceptions” —conveyed b...

متن کامل

Internal Fault Detection, Location, and Classification in Stator Winding of the Synchronous Generators Based on the Terminal Voltage Waveform

In this paper, a novel method is presented for detection and classification of the faultyphase/region in the stator winding of synchronous generators on the basis of the resulting harmoniccomponents that appear in the terminal voltage waveforms. Analytical results obtained through DecisionTree (DT) show that the internal faults are not only detectable but also they can be classified andthe rela...

متن کامل

Agreement in synchronous networks with ubiquitous faults

In this paper we are interested in synchronous distributed systems subject to transient and ubiquitous failures. This includes systems where failures will occur on any communication link, systems where every processor will experience at one time or another send or receive failure, etc., and, following a failure, normal functioning resuming after a finite time. Notice that these cases cannot be ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Computers

دوره 49  شماره 

صفحات  -

تاریخ انتشار 2000